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Abstract 

We study scheduling problems motivated by recently developed techniques for micro- 
processor thermal management at the operating systems level. The general scenario can 
be described as follows. The microprocessor's temperature is controlled by the hardware 
thermal management system that continuously monitors the chip temperature and au- 
tomatically reduces the processor's speed as soon as the thermal threshold is exceeded. 
Some tasks are more CPU-intensive than other and thus generate more heat during ex- 
ecution. The cooling system operates non-stop, reducing (at an exponential rate) the 
deviation of the processor's temperature from the ambient temperature. As a result, the 
processor's temperature, and thus the performance as well, depends on the order of the 
task execution. Given a variety of possible underlying architectures, models for cooling 
and for hardware thermal management, as well as types of tasks, this scenario gives rise 
to a plethora of interesting and never studied scheduling problems. 

We focus on scheduling real-time jobs in a simplified model for cooling and thermal 
management. A collection of unit-length jobs is given, each job specified by its release 
time, deadline and heat contribution. If, at some time step, the temperature of the system 
is t and the processor executes a job with heat contribution h, then the temperature at 
the next step is (r + h)/2. The temperature cannot exceed the given thermal threshold 
T. The objective is to maximize the throughput, that is, the number of tasks that meet 
their deadlines. We prove that, in the offline case, computing the optimum schedule is 
NP-hard, even if all jobs arc released at the same time. In the online case, we show a 
2-competitive deterministic algorithm and a matching lower bound. 

1 Introduction 

Background. The problem of managing the temperature of processor systems is not new; 
in fact, the system builders had to deal with this challenge since the inception of computers. 
Since early 1990s, the introduction of battery-operated laptop computers and sensor systems 
highlighted the related issue of controlling the energy consumption. 

Most of the initial work on these problems was hardware and systems oriented, and 
only during the last decade substantial progress has been achieved on developing models 
and algorithmic techniques for microprocessor temperature and energy management. This 
work proceeded in several directions. One direction is based on the fact that the energy 
consumption is a fast growing function of the processor speed (or frequency) . Thus we can save 
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energy by simply slowing down the processor. Here, algorithmic research focussed on speed 
scaling - namely dynamically adjusting the processor speed over time to optimize the energy 
consumption while ensuring that the system meets the desired performance requirements. 
Another technique (applicable to the whole system, not just the microprocessor) involves 
power-down strategies, where the system is powered-down or even completely turned off when 
some of its components are idle. Since changing the power level of a system introduces some 
overhead, scheduling the work to minimize the overall energy usage in this model becomes a 
challenging optimization problem. 

Models have also been developed for the processor's thermal behavior. Here, the main 
objective is to ensure that the system's temperature does not exceed the so-called thermal 
threshold, above which the processor cannot operate correctly, or may even be damaged. 
In this direction, techniques and algorithms have been proposed for using speed-scaling to 
optimize the system's performance while maintaining the temperature below the threshold. 

We refer the reader to the survey by Irani and Pruhs |5], and references therein, for more 
in-depth information on the models and algorithms for thermal and energy management. 

Temperature-aware scheduling. The above models address energy and thermal man- 
agement at the micro-architecture level. In contrast, the problem we study in this paper 
addresses the issue of thermal management at the operating systems level. Most of the previ- 
ous work in this direction focussed on multi-core systems, where one can move tasks between 
the processors to minimize the maximum temperature [91 [T] |2l [61 |HJ HI [10] . However, as it 
has been recently discovered, even in single-core systems one can exploit variations in heat 
contributions among different tasks to reduce the processor's temperature through appro- 
priate task scheduling [U HI El EI]- In this scenario, the microprocessor's temperature is 
controlled by the hardware dynamic thermal management (DTM) system that continuously 
monitors the chip temperature and automatically reduces the processor's speed as soon as 
the thermal threshold (maximum safe operating temperature) is exceeded. Typically, the 
frequency is reduced by half, although it can be further reduced to one fourth or even one 
eighth, if needed. Running at a lower frequency, the CPU generates less heat. The cooling 
system operates non-stop, reducing (at an exponential rate) the deviation of the processor's 
temperature from the ambient temperature. Once the chip cools down to below the threshold, 
the frequency is increased again. 

Different tasks use different microprocessor units in different ways; in particular, some 
tasks are more CPU- intensive than other. As a result, the processor's thermal behavior - 
and thus the performance as well - depends on the order of the task execution. In particular, 
Yang et al. [llj point out that, based on the standard model for the microprocessor thermal 
behavior, for any given two tasks, scheduling the "hotter" job before the "cooler" one, results 
in a lower final temperature than after the reverse order. They take advantage of this phe- 
nomenon to reduce the number of DTM invocations, thus improving the performance of the 
OS scheduler. 

With multitudes of possible underlying architectures (for example, single- vs. multi-core 
systems), models for cooling and hardware thermal management, as well as types of jobs 
(real-time, batch, etc.), the scenario outlined above gives rise to a plethora of interesting and 
never yet studied scheduling problems. 
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Our model. We focus on scheduling real-time jobs in a somewhat simplified model for 
cooling and thermal management. The time is divided into unit time slots and each job has 
unit length. (These jobs represent unit slices of the processes present in the OS scheduler's 
queue.) We assume that the heat contributions of these jobs are known. This is counterintu- 
itive, but reasonably realistic, for, as discussed in [TT], these values can be well approximated 
using appropriate prediction methods. 

In our thermal model we assume, without loss of generality, that the ambient temperature 
is and that the heat contributions are expressed in the units of temperature (that is, by how 
much they would increase the chip temperature in the absence of cooling). In reality jllj . 
during the execution of a job, its heat contribution is spread over the whole time slot and so is 
the effect of cooling; thus, the final temperature can be expressed using an integral function. 
In this paper, we use a simplified model where we first take into account the job's heat 
contribution, and then apply the cooling, where the cooling simply reduces the temperature 
by half. 

Finally, we assume that only one processor frequency is available. Consequently, if there 
is no job whose execution does not cause a thermal violation, the processor must stay idle 
through the next time slot. 

Our results. Summarizing, our scheduling problem can be now formalized as follows. A 
collection of unit-length jobs is given, each job j with a release time Vj, deadline dj and heat 
contribution hj. If, at some time step, the temperature of the system is r and the processor 
executes a job j, then the temperature at the next step is (r + hj)/2. The temperature 
cannot exceed the given thermal threshold T. The objective is to compute a schedule which 
maximizes the number of tasks that meet their deadlines. 

We prove that in the offline case computing the optimum schedule is NP-hard, even if all 
jobs are released at the same time and have equal deadlines. In the online case, we show a 
2-competitive deterministic algorithm and a matching lower bound. 

2 Terminology and Notation 

The input consists of n unit-length jobs that we number 1,2, ... ,n. Each job j is specified 
by a triple (rj,dj,hj) G N x N x Q, where rj is its release time, dj is the deadline and hj is 
its heat contribution. The time is divided into unit-length slots and each job can be executed 
in any time slot in the interval [rj,dj\. By t u we denote the processor temperature at time 
u. The initial temperature is To = 0, and it changes according to the following rules: if 
the temperature of the system at a time u is t u and the processor executes a job j then 
the temperature at time u + 1 is t u+ \ = (t u + hj)/2. The temperature cannot exceed the 
given thermal threshold T. Without loss of generality, we assume that T = 1. Thus if 
(r u + hj)/2 > 1 then j cannot be executed at time u. Idle slots are treated as executing a job 
with heat contribution 0, that is, after an idle slot the temperature decreases by half. 

Given an instance, as above, the objective is to compute a schedule with maximum 
throughput, where throughput is defined as the number of completed jobs. Extending the 
standard notation for scheduling problems, we denote the offline version of this problem by 
l\ri,hi,pi = l\^Ui. 

In the online version, denoted l|online-rj, hi,pi = 1| ]P Ui, jobs are available to the algo- 
rithm at their release time. Scheduling decisions of the algorithm cannot depend on the jobs 
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that have not yet been released. 

Example. Suppose we have four jobs, specified in notation j — ► (rj,dj, hj): 1 — > (0, 2, 0.4), 
2 -» (0,4,0.6), 3 -> (2,3,1.9), 4 -> (4,6,0.8). 



0.0 0.2 0.4 0.2 0.5 0.25 

Schedule 1 1 | 2 | - | 4 | - | 

0.0 0.2 0.1 1.0 0.8 0.8 

Schedule 2 1 | - | 3 | 2 | 4 | 

Figure 1: Example of two schedules. 

Figure [T] shows these jobs and two schedules. Numbers above the schedules denote tem- 
peratures. In the first schedule, when we schedule job 2 at time 1, the processor is too hot to 
execute job 3, so it will not complete job 3 at all. In the second schedule, we stay idle in step 
2, allowing us to complete all jobs. 

3 The NP-Completeness Proofs 

In this section we prove that the scheduling problem l\r{,hi,pi = 1| ^ Ui is NP-hard. For the 
sake of exposition, we start with a proof for the general case, and later we give a proof for 
the special case when all release times and deadlines are equal. 

Theorem 1 The offline problem 1 1 , hi,pi = 1| Ui is NP-hard. 

Proof: We use a reduction from the 3-Partition problem, defined as follows: we are given a 
set S of 3n positive integers a\, . . . , a% n such that 0/4 < ai < (3/2 for all i, where (3 = - ^ a^. 
The goal is to partition S into n subsets, each subset with the total sum equal exactly j3. (By 
the assumption on the di, each subset will have to have exactly 3 elements.) This partition 
of S will be called a 3-partition. 3- Partition is well-known to be NP-hard [3] in the strong 
sense, that is, even if maxj < p(n), for some polynomial p{n). 

We now describe the reduction. For the given instance of 3-Partition we construct an 
instance of l|r,, hi,pi = 1| ^ Ui with 4n jobs. These jobs will be of two types: 

(i) First, we have 3n jobs that correspond to the instance of 3-Partition. For every 

1 < i < 3n we create a job of heat contribution 2 — 2 1 ~ a,; , release time 1 and deadline 
n((3 + l). 

(ii) Next, we create n additional "gadget" jobs. These jobs are tight, meaning that their 
deadline is just one unit after the release time. The first of these jobs has heat contri- 
bution 2 and release time 0. Then, for each 1 < j < n — 1, we have a job with heat 
contribution 1 and release time j{(5 + 1). 
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We claim that S has a 3-partition if and only if the instance of l|rj, hi,pi = 1| Ui constructed 
above has a schedule with throughput 4n (that is, with all jobs completed). 

The main idea is this: Imagine that at some moment the temperature is 1, and we want 
to schedule a job of heat contribution 2 — 2 1 ~ x , for some integer x > 1. Then we must 
first wait x — 1 time units, so that the processor cools down to (\) x ~ l = 2 l ~ x , before we 
can schedule that job, and then right at the completion of this job the temperature is 1 
again. The analogous property holds, in fact, even if at the beginning the temperature was 
some r > 1/2, except that then, after completing this job, the new temperature will be 
T > = ( T 2 1 - x + 2 - 2 1 ' x )/2 > (2 1 ~ x /2 + 2- 2 l ~ x )/2 = 1 - 2~ x ' 1 > 1/2, that is, it may be 
different than r but still strictly greater than 1/2. With this observation in mind, the proof 
of the above claim is quite easy. 

(<=) First we show that if there is a solution to the scheduling problem, then 5 has a 
3-partition. Note that the tight jobs divide the time into n intervals of length (3 each. Also 
each of the 3n other jobs is scheduled in exactly one of these intervals. This defines a partition 
of S into n sets. 

Now we claim that after every job execution the temperature is strictly more than 1/2. 
This is true for the first job to be scheduled, since it has heat contribution 2. Each other job 
in the instance, including the tight jobs, has heat contribution at least 1. Therefore right after 
its execution the temperature is at least 1/2, already if we take only this job into account. 
But there is also a declining but non-zero temperature contribution from the first tight job. 
So overall the temperature after every execution is strictly more than 1/2. 

Together with the earlier observation, this implies that every non-tight job of heat contri- 
bution 2 x ~ a% must be preceded by ai — 1 idle units, thus using a« time slots in total. Therefore 
every set in the above mentioned partition has the total sum at most (3. Since there are at 
most n sets in the partition, the total sum of each must be exactly (3. This proves that S has 
a 3-partition. 

(=^>) Now we show the other implication, namely that if S has a 3-partition then there is a 
solution to the scheduling instance. Simply schedule the tight jobs at their release time. This 
divides the time into n intervals of length (3 each. Assign each of the n sets in the partition to 
a distinct interval and schedule its jobs in this interval: every integer en in the set corresponds 
to a job of heat contribution 2 — 2 1_ai , and we schedule it preceded with en — 1 idle time 
units. The jobs of the set can be scheduled in an arbitrary order, the important property 
being that, since their total sum is (3, they all fit exactly in this interval. After all jobs in 
one set are executed the temperature is exactly 1, and during the execution the temperature 
does not exceed 1 (because we pad the schedule with enough idle slots). All release time and 
deadline constraints are satisfied, so the scheduling instance has a feasible solution as well. 

It remains to show that the above instance of l\ri,hi,pi = 1|X^^» can be produced in 
polynomial time from the instance of 3-Partition. Indeed, every number aj is mapped to 
some number 2 — 2 1_a % which is described with O(aj) bits. Since we assumed that all numbers 
aj for 1 < i < n are polynomial in n, the reduction will take polynomial time, and the proof 
is complete. □ 

The above construction used the constraints of the release times and deadlines to fix tight 
jobs that force a partition of the time into intervals. We can actually prove a stronger result, 
namely that the problem remains N P-complete even if all release times are and all deadlines 
are equal. Why is it interesting? One common approach in designing on-line algorithms for 
scheduling is to compute the optimal schedule for the pending jobs, and use this schedule to 
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make the decision as to what execute next. The set of pending jobs forms an instance where 
all jobs have the same release time. Our NP-hardness result does not necessarily imply that 
the above method cannot work (assuming that we do not care about the running time of the 
online algorithm), but it makes this approach much less appealing, since reasoning about the 
properties of the pending jobs is likely to be very difficult. 

Theorem 2 The offline problem l|rj, hi,Pi = 1| ^ Ui is strongly NP-hard even for the special 
case when jobs are released at time and all deadlines are equal. 

Proof: The reduction is from Numerical-3D-Matching. In this problem, we are given 
3 sets A,B,C of n non-negative integers each and a positive integer (3. A 3-dimensional 
numerical matching is a set of n triples (a,b,c) G A x B x C such that each number is 
matched exactly once (appears in exactly one triple) and all triples (a, b, c) in the set satisfy 
a + b + c = (3. Numerical-3D-Matching is known to be NP-complete even when the values 
of all numbers are bounded by a polynomial in n, it is referenced as problem [SP16] in [3 . 
(Clearly, this problem is quite similar to 3- Partition that we used in the previous proof.) 
Without loss of generality, we can assume (Al) that every x S A U B U C satisfies x < (3 

and (A2) that Y, xe AuBuc x = $ n - 

We now describe the reduction. Let be the constant a = 1/25 and the function / :xh 
a(l + x/8/3). The instance of 1 1 , hi,pi = 1| ^ Ui will have 4n + 1 jobs, all with release time 
and deadline 4n + 1. These jobs will be of two types: 

(i) First we have 3n jobs that correspond to the instance of Numerical-3D-Matching: 

for every a S A, there is a job of heat contribution 8/(a), for every b S B, there is a 
job of heat contribution 4/(6), and for every c € C, there is a job of heat contribution 
2/(c). We call these jobs, respectively, A-jobs, B-jobs and C-jobs. 

(ii) Next, we have n + 1 "gadget" jobs. The first of these jobs has heat contribution 2, and 

the remaining ones 1.75. We call these jobs, respectively, 2- and 1.75-jobs. 

We claim that the instance A, B, C, [3 has a numerical 3-dimensional matching if and 
only if the instance of l\ri,hi,pi = 1| Yl Ui that we constructed has a schedule with all jobs 
completed not later than at time 4n + 1 . 

The idea of the proof is that the gadget jobs are so hot that they need to be scheduled 
only every 4-th time unit, separating the time into n blocks of 3 consecutive time slots each. 
Every remaining job has a heat contribution that consists of two parts: a constant part (8a, 
4a or 2a) and a tiny variable part that depends on the instance of the matching problem. 
This constant part is so large that in every block there is a single ^4-job, a single -B-job and a 
single C-job, and they must be scheduled in that order. This defines a partitioning of A, B, C 
into triplets of the form (a, b, c) S A x B x C. Since the gadget jobs are so hot, they force 
every triple (a, 6, c) to satisfy a + b + c < (3. We now make this argument formal. 

(=^) Suppose there is a solution to the instance of Numerical-3D-Matching. We con- 
struct a schedule where all jobs complete at or before time 4n + 1. Schedule the 2-job at time 
0, and all other gadget jobs every 4-th time slot. Now the remaining slots are grouped into 
blocks consisting of 3 consecutive time slots each. For i = 1, 2, . . . , n, associate the z-th triple 
(a, b, c) from the matching with the i-th block, and the corresponding A-,B- and C-jobs are 
executed in this block in the order A,B,C — see Figure [2] 
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Figure 2: The structure of the schedules in the proof. 



By construction, every job meets the deadline, so it remains to show that the temperature 
never exceeds 1. The non-gadget jobs have all heat contribution smaller than 1, by assumption 
(Al), so execution of a non-gadget job cannot increase the temperature to above 1, as long 
as the temperature before was not greater than 1. 

Now we show by induction that right after the execution of a gadget job the temperature 
is exactly 1. This is clearly the case after execution of the first job, since its heat contribution 
is 2. Now let u be the time when a 1.75-job is scheduled, and suppose that at time u — 3 
the temperature was 1. Let (a, b, c) be the triple associated with the block consisting of time 
slots between u — 3 and u. Then, by a + b + c = /?, at time u the temperature is 

1 8/(a) 4/(6) 2/(c) 1 / a + b + c \ 1 (24 1\ 1 
8 + — + — + — = 8 + H +_ J = 8 +a U + 8j = I" 

This shows that at time u + 1, after scheduling a 1.75-job, the temperature is again (1.75 + 
1/4) /2 = 1. We conclude that the schedule is feasible. 

(•^=) Now we show the remaining implication. Suppose the instance of l\ri, hi,pi = 1| ^ Ui 
constructed above has a schedule where all jobs meet the deadline 4n + 1. We show that there 
exists a matching of A, B, C. 

We first show that this schedule must have the form from Figure [2] First, note that since 
all 4n + 1 jobs have deadline 4n + 1, all jobs must be scheduled without any idle time between 
time and 4n + 1. This means that the gadget job of heat contribution 2 must be scheduled 
at time 0, because that is the only moment of temperature 0. Also note that every job has 
heat contribution at least 2/(0) = 2a. 

Now we claim that all 1.75-jobs have to be scheduled every 4-th time unit. This holds 
because two units after scheduling a 1.75-job, the temperature is at least 

1.75 2a 2a 

_ + _ + _ >1/4 . 

Therefore two executions of 1.75-jobs must be at least 4 time units apart, and this is only 
possible if they are scheduled exactly at times 4i for i = 1, . . . , n. 

We claim that after every execution of a gadget job, the temperature is at least r = 
364/375. Clearly this is the case after the execution of the 2-job. Now assume that at time 
4i + 1, for some i = Q, . . . ,n — 1, the claim holds. Then at time 4i + 5, after the execution of 
the next 1.75-job, the temperature is at least 

r 2a 2a 2a 1.75 

16 + T6 + "8~ + T + ~2~ ~ T ' 

We show now that every block contains exactly one A-job, one £>-job and one C-job, in that 
order. Towards contradiction, suppose that some A-job is scheduled at the 2nd position of a 
block, say at time 4i + 2 for some i G {0, . . . , n — 1}. Its heat contribution is at least 8/(0). 
Therefore the temperature at time Ai + 4 would be at least 

r 2a Sa 2a _ . . 

8 + T + T + T >1/4 ' 
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contradicting that a 1.75-job is scheduled at that time: A similar argument shows that ^4-jobs 
cannot be scheduled at position 3 in a block, and therefore the 1st position of a block is always 
occupied by an ^4-job. 

By an analogous reasoning, we show that a B-job cannot be scheduled at the 3rd position 
of some block. It it were scheduled there, the temperature at the end of block would be at 
least 

r 8a 2a 4a 

8 + T + T + T >1/4 ' 

again contradicting that a 1.75-job is scheduled at the end of the block. 

We showed that every block contains jobs that correspond to some triple (a, b, c) £ A x 
B x C. It remains to show that each such triple satisfies a + b + c = 0. Let (aj, bi, q) be the 
triple corresponding to the i-th block, for 1 < i < n. 

Define to = 1 and 

U = ^■8/(a i ) + ^ -4/(^ + 1 -2/^) + ^ -1.75 

= ^m + iai + bi + ayp]. 

Thus ti represents the contribution of the ith block and a following gadget job to the temper- 
ature right after this gadget job. This implies that, for 1 < k < n, the temperature at time 
4k + 1 is exactly EiUl 1 / 16 )^^- B Y Assumption (A2), J27=i( a i + b i + q) = n(3, and thus 
J2i=l U = l6 n - 

Define Pi = U — 15/16 for i = 1,2, n. From the previous paragraph, 

n 

Ew = o. (i) 

i=l 

As mentioned earlier, X]i=o(Vl6) fe ~*ti is the temperature at time 4k+l, so we have X^=o(Vl6) fc_;i ti < 
1. Therefore, for all 1 < k < n we get 



lQ- k ^Wpi = lQ- k ^W{ti - 15/16) 



i=i i=i 



= E(V16)^- (1/16)* -15^(1/16) 

i=0 i=l 

< 1 - (l/16) fc - 15(1 - (l/16) fe )/15 = 
We conclude that for k = 1, 2, n we have 



E 16 >* ^ °- ( 2 ) 



i=i 

To complete the proof, it remains to show that pi = for all i, for this will imply that 
tti + bi + Ci = (3, which in turn implies that there is a matching. We prove this claim by 
contradiction. Suppose that not all p^s are zero. Let £ be the smallest index such that pi > 
and 

Pi + ...+Pt>0- (3) 
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Clearly, £ > 2. By the minimality of £, for every 2 < k < £ — lwe have 

PX + ...+ Pk-i < and p k + . . . + p e > 0. 
There are crj > 0, i = 1, ,such that Xrf=i ^ = Then 
£ I j i-i i 

^Wpj = ^^diPj = ^2<Ti^2pi + otvi > o, 

j = l j=l 1=1 i=l j=i 

because all terms are non-negative and pe > 0. This contradicts Q. 

It remains to show that the above instance of l\hi,pi = l\^2,Ui can be produced in 
polynomial time from the instance of Numerical-3D-Matching. Indeed, every number 
xGAuBuCis mapped to some fraction, where both the denominator and numerator 
are linear in x and (3. Therefore if we represent fractions by writing the denominator and 
numerator, and not as some rounded decimal expansion, the reduction will take polynomial 
time, and the proof is complete. □ 

Theorem [2] implies that other variants of temperature-aware scheduling are N P-hard as 
well. Consider for example the problem l\hi,pi = l|C max , where the objective is to minimize 
the makespan, that is, the maximum completion time. In the decision version of this problem 
we ask if all jobs can be completed by some given deadline C - which is exactly what we 
proved above to be N P-hard. It also gives us NP-hardness of l\hi,pi = 1| Cj- To prove 
this, we can use the decision version of this problem where we ask if there is a schedule for 
which the total completion time is at most n(n — l)/2 (where n is the number of jobs). 



4 An Online Competitive Algorithm 

In this section we show that there is a 2-competitive algorithm for 1 1 online- rj, hi,Pi = 1| U{. 
We will show, in fact, that a large class of deterministic algorithms is 2-competitive. 

Given a schedule, we will say that a job j is pending at time u if j is released, not expired 
(that is, rj<u< dj) and j has not been scheduled before u. If the temperature at time u is 
t u and j is pending, then we call j admissible if r M + hj < 2, that is, j is not too hot to be 
executed. 

We say that a job j dominates a job k if j is both not hotter and has the same or smaller 
deadline than k, that is hj < hf. and dj < dk- If at least one of these inequalities is strict, then 
we say that j strictly dominates k. An online algorithm is called reasonable if at each step (i) 
it schedules a job whenever one is admissible (the non-waiting property), and, if there is one, 
(ii) it schedules an admissible job that is not strictly dominated by another pending job. The 
class of reasonable algorithms contains, for example, the following two natural algorithms: 

CoolestFirst: Always schedule a coolest admissible job (if there is any), breaking ties in favor 
of jobs with earlier deadlines. 

EarliestDeadlineFirst: Always schedule an admissible job (if there is one) with the earliest 
deadline, breaking ties in favor of cooler jobs. 

If two jobs have the same deadlines and heat contributions, both algorithms give preference 
to one of them arbitrarily. 
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Theorem 3 Any reasonable algorithm for l\online-ri, hi,pi = 1\^2 V{ is 2- competitive. 

Proof: Let A be any reasonable algorithm. We fix some instance, and we compare the 
schedules produced by A and the adversary on this instance. The proof is based on a charging 
scheme that maps jobs executed by the adversary to jobs executed by A in such a way that 
no job in A's schedule gets more than two charges. 
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Figure 3: Four types of charges. The vertical inequality signs between the schedules show the 
relation between the temperatures. 



We now describe this charging scheme. (See Figure|3]for illustration.) There will be three 
types of charges, depending on whether A is busy or idle at a given time step and on the 
relative temperatures of the schedules of A and the adversary. The temperature at time it in 
the schedules of A and the adversary will be denoted by t u and r' w respectively. 

Suppose that at some time u, A schedules a job k while the adversary schedules a job j, 
or that the adversary is idle (we treat this case as if executing a job j with hj = 0.) Then 
step u will be called a relative-heating step if k is strictly hotter than j, that is > hj. Note 
that if t v > t' v for some time v, then a relative-heating step must have occurred before time 
v. 

Consider now a job j scheduled by the adversary, say at time v. The charge of j is defined 
as follows: 

Type 1 Charges: If A schedules a job k at time v, charge j to k. Otherwise, we are idle, 
and then we have two more cases. 

Type-2 Charges: Suppose that A is hotter than the adversary at time v but not at v + 1, 
that is t u > t' u and t u+ \ < r' u+l . In this case we charge j to the job q executed by A in the 
last relative-heating step before v. (As explained above, this step exists.) 

Type-3 Charges: Suppose now that either A is not hotter than the adversary at v or A is 
hotter than the adversary at v + 1. In other words, t v < t' u or t v+ \ > t' v+1 . (Note that in the 
latter case we must also have t v > t' v as well, since the algorithm is idle.) 

We claim that r v + hj < 2, which means that neither j or any job I with hi < hj can 
be pending at v. To justify this, we consider the two sub-cases of the condition for type-3 
charges. If t v < t' v , the claim is trivial, since then T v + hj < T' u + hj < 2, because the adversary 
executes j at time v. So assume now that t v+ \ > t' v+1 . Since A is idle, we have t v+ \ < 1/2. 
Therefore hj = 2r^ +1 — t' v < 2t' v+1 < 1, and the claim follows because t v < 1 as well. 

From the above claim, j was scheduled by A at some time u < v. To find a job that 
we can charge j to, we construct a chain of jobs j,j',j", ■ ■ ■ , j* with strictly decreasing heat 
contributions. Further, all jobs in this chain except j* will be executed by A at relative- 
heating steps. This chain will be determined uniquely by j, and we will charge j to j*. If, 
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at time u, the adversary is idle or schedules an equal or hotter job, then j* = j, that is, we 
charge j to itself (its "copy" in A's schedule). Otherwise, if the adversary schedules j' at 
time u then j' is strictly cooler than j, that is hji < hj. Now we claim that the algorithm 
schedules j' at some time before v. Indeed, if f is scheduled before u, we are done. Otherwise, 
j' is pending at u, and, since the algorithm never schedules a dominated job, we must have 
df > dj > v + 1. By our earlier observation and by hy < hj, if A did not schedule j' before 
v, then j' would have been admissible at v, contradicting the fact A is idle at v. So now 
the chain is j,f. Let v! be the time when A schedules j'. If the adversary is idle at time 
v! or if j' is not hotter than the job executed by the adversary at time v! , we take j* = j' . 
Otherwise, we take j" to be the job executed by the adversary at time u' , and so on. This 
process must end at some point, since we deal with strictly cooler and cooler jobs. So the job 
j* is well-defined. 

This completes the description of the charging scheme. Now we show that any job sched- 
uled by A will get at most two charges. Obviously, each job in »4.'s schedule gets at most 
one type-1 charge. In-between any two time steps that satisfy the condition of the type-2 
charge there must be a relative-heating step, so the type-2 charges are assigned to distinct 
relative-heating steps. As for type-3 charges, every chain defining a type-3 charge is uniquely 
defined by the first considered job, and these chains are disjoint. Therefore type-3 charges 
are assigned to distinct jobs. 

Now let A; be a job scheduled by A at some time v. By the previous paragraph, k can get 
at most one charge of each type. We claim that k cannot get charges of each type 1, 2 and 3. 
Indeed, if k receives a type-1 charge, then the adversary is not idle at time v, and schedules 
some job I. If k also receives a type-2 charge, then v must be a relative-heating step, that is 
/ifc > hg. But to receive a type-3 charge, k would be the last job j* in a chain of some job j, 
and since the chain was not extended further, we must have h^ < hg. So type-1, type-2 and 
type-3 charges cannot coincide. 

Summarizing the argument, we have that every job scheduled by the adversary is charged 
to some job scheduled by A, and every job scheduled by A receives no more than 2 charges. 
Therefore the competitive ratio of A is not more than 2. □ 



5 A Lower Bound on the Competitive Ratio 

Theorem 4 Every deterministic online algorithm for l\online-ri,hi,pi = has com- 

petitive ratio at least 2. 

Proof: Fix some deterministic online algorithm A. We (the adversary) release a job 1 — > 
(0,3, 1.2) (in notation j — > (rj,dj,hj)). If A schedules it at time 0, we release at time 1 a 
tight job 2 — > (1,2,1.6) and schedule it followed by 1. A's schedule is too hot at time 1 to 
execute job 2. If A does not schedule job 1 at time 0, then we schedule it at and release at 
time 2 (and schedule) a tight job 3 — > (2,3, 1.6) at time 2. In this case, A cannot complete 
both jobs 1 and 3 without violating the thermal threshold. In both cases we schedule two 
jobs, while A schedules only one, completing the proof. (See Figure El) □ 
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Figure 4: The lower bound for deterministic algorithms. 

6 Final Comments 

Many open problems remain. Perhaps the most intriguing one is to determine the randomized 
competitive ratio for the problem we studied. The proof of Theorem [4] can easily be adapted 
to prove the lower bound of 1.5, but we have not been able to improve the upper bound of 2; 
this is, in fact, the main focus of our current work on this scheduling problem. One approach, 
based on Theorem [3] would be to randomly choose, at the beginning of computation, two 
different reasonable algorithms Ai, A%, each with probability 1/2, and then deterministically 
execute the chosen Ai. So far, we have been able to show that for many natural choices for 
A\ and A2 (say, CoolestFirst and EarliestDeadlineFirst), this approach does not work. 

Extensions of the cooling model can be considered, where the temperature after executing 
j is (t + hj) I R, for some R > 1. Even this formula, however, is only a discrete approximation 
for the true model (see, for example, [H]), and it would be interesting to see if the ideas 
behind our 2-competitive algorithm can be adapted to these more realistic cases. 

In reality, thermal violations do not cause the system to idle, but only to reduce the 
frequency. With frequency reduced to half, a unit job will execute for two time slots. Several 
frequency levels may be available. 

We assumed that the heat contributions are known. This is counter-intuitive, but not 
unrealistic, since the "jobs" in our model are unit slices of longer jobs. Prediction methods 
are available that can quite accurately predict the heat contribution of each slice based on 
the heat contributions of the previous slices. Nevertheless, it may be interesting to study a 
model where exact heat contributions are not known. 

Other types of jobs may be studied. For real-time jobs, one can consider the case when not 
all jobs are equally important, which can be modeled by assigning weights to jobs and maxi- 
mizing the weighted throughput. For batch jobs, other objective functions can be optimized, 
for example the flow time. 

One more realistic scenario would be to represent the whole processes as jobs, rather then 
their slices. This naturally leads to scheduling problems with preemption and with jobs of 
arbitrary processing times. When the thermal threshold is reached, the execution of a job 
is slowed down by a factor of 2. Here, a scheduling algorithm may decide to preempt a job 
when another one is released or, say, when the processor gets too hot. 

Finally, in multi-core systems one can explore the migrations (say, moving jobs from hotter 
to cooler cores) to keep the temperature under control. This leads to even more scheduling 
problems that may be worth to study. 
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